An Analysis of Author Contribution Patterns in Eclipse Foundation Project Source Code
نویسندگان
چکیده
Collaborative development is a key tenet of open source software, but if not properly understood and managed, it can become a liability. We examine author contribution data for the newest revision of 251,633 Java source files in 592 Eclipse projects. We use this observational data to analyze collaboration patterns within files, and to explore relationships between file size, author count, and code authorship. We calculate author entropy to characterize the contributions of multiple authors to a given file, with an eye toward understanding the degree of collaboration and the most common interaction patterns.
منابع مشابه
An Analysis of the Effect of Code Ownership on Software Quality across Windows, Eclipse, and Firefox
Ownership is an important aspect of software development in large software teams. We examine the relationship between a number of ownership measures and software faults/failures in three large software projects drawn from different process domains: Windows Vista, the Eclipse Java IDE, and the Firefox Web Browser. We find that in all cases, measures based on ownership levels, such as the number ...
متن کاملLocating Matching Method Calls by Mining Revision History Data
Developing an appropriate fix for a software bug often requires a detailed examination of the code as well as generation of appropriate test cases. However, certain categories of bugs are usually easy to fix. In this paper we focus on bugs that can be corrected with a one-line code change. As it turns out, one-line source code changes very often represent bug fixes. Moreover, a significant frac...
متن کاملParichayana: An Eclipse Plugin for Detecting Exception Handling Anti-Patterns and Code Smells in Java Programs
Anti-patterns and code-smells are signs in the source code which are not defects (does not prevent the program from functioning and does not cause compile errors) and are rather indicators of deeper and bigger problems. Exception handling is a programming construct designed to handle the occurrence of anomalous or exceptional conditions (that changes the normal flow of program execution). In th...
متن کاملThe nature and composition of the Linux kernel developer community: a dynamic analysis
This paper presents data gathered from a detailed study of the structure and composition of the Linux kernel developer community, as sampled through three versions of the Linux kernel. Based on previously defined methodology and tools, data is presented on the distribution of authorship across modules, the degree of collaboration between authors, and the distribution, structure and inter-depend...
متن کاملCodeQuest - Source Code Querying with Datalog
Understanding source code is vital to many tasks in software engineering. Source code querying tools are designed to help such understanding, by allowing programmers to explore relations that exist between different parts of the codebase. The contribution of such a system named CodeQuest is the topic of this dissertation. One of the modern source code querying and browsing tools for Java is JQu...
متن کامل